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ABSTRACT 


The partition problem is that step in the layout problem 
in which it must be decided which of the elementary digital 
circuits are to be coalesced into a single, electronic 
package. A solution of the partition problem must satisfy 
constraints on the maximum number of elementary circuits that 
can be put into a single package, and on the number of exter- 
nal connections that can be attached to the package. 

A solution due to Lawler et al [Ref. 12], which minimizes 
delay caused by clustering electronic elements, is extended 
to cyclic networks. A new algorithm to extract from a graph 
the maximal strongly-connected subgraphs (lobes) is developed, 
and a new approach to clustering the digital elements of a 
lobe is presented. The digital circuit is represented by a 
bipartite graph, and solutions are expressed in terms of 


graph theory. 
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i. JANTRODUCTION 


ee LOE. DESIGN BROCE 

The design of a digital electronic circuit takes place 
in several stages. In the first stage the designer determines 
precisely the relations between the inputs to the digital 
circuit and the deSired outputs. Based on this analysis, and 
dependent also on certain physical characteristics of the 
electronic medium that is used, the fundamental logic set of 
the digital circuit is defined. In the fundamental logic set 
are groups of elementary circuit elements such as "or," "and" 
or "nor" gates, combined in small electronic packages such as 
miniturized integrated circuits or semiconductor chips (sub- 
strates). Members of the fundamental logic set are the most 
basic replaceable logic elements of the digital circuit, and 
1t is from these that the digital circuit is built. 

Based also on the analysis of inputs and desired outputs 
is the determination of the specifications of the digital 
circuit. After the specifications are complete, the synthesis 
of the digital circuit takes place. Following the specifica- 
tions, elements of the fundamental logic set are interconnected 
to form a representation of a digital circuit. One of the 
prime objectives of the logic designer is simplicity. 

The logic design 1S produced relatively independently of 
the physical structure of the digital circuit. The engineer 


takes as his input the results of the logic designer, and 








produces the digital circuit according to the phy saeameane 
electronic characteristics of the circuit elements and their 
interconnections. 

Kodres [Ref. 10] refers to that step of the logic design 
process, in which it must be determined how elementary logic 
elements are to be interconnected, ae (ene layout problem. 

The layout problem is concerned with such questions as how 
many electronic elements can be placed ina Single electronic 
package, in what relative locations they should be placed 
within the package, and in what manner can several packages be 
interconnected. The layout problem is dependent on physical 
properties such as the interconnection of electronic compo- 
nents, the number of electronic components or the number of 
input or output signals at a functional component. 

The layout problem can be applied to the design of digi- 
Gal Circuits at any of several levels of increasing scope. 
For example, the results of the theory developed in general 
terms can be applied with little modification to the design 
of members of the fundamental logic set and the layout of 
gates within such a small package. Application can also be 
made to the clustering of fundamental logic set elements with- 


in larger electronic modules. 


5B. THE PARTITION PROBI=M 
The partition problem is that step in the layout problem 
in which it must be decided which of the elementary circuits 


are to be combined into a single electronic package. The 








solution of the partition problem must satisfy a number of 
constraints. These constraints are divided into two cate- 
gories: packaging constraints and performance constraints. 

Most important to the designer of packages is to keep 
within space limitations. Thus the number of elementary 
circuits in a single package is limited by their very size. 
Closely associated with the problem of limited space is a 
Limit on the number of external connections that are possible 
for a package of a certain size. 

The limit on the number of external connections is more 
or less critical depending on the application of the parti- 
tion problem. In the case of a semiconductor chip, the 
electronic elements are placed on the chip chemically. These 
elements are extremely small. In comparison, the external 
connections to the chip are physically attached to the package 
that contains the chip by means of solder connections, and are 
thus an order of magnitude Mareer than the internal logic 
elements. In this case, the external connection constraint 
1S very critical. 

If the electronic package is the size of a replaceable 
module, or circuit card, in a computer, then the external 
connection constraint is less critical. In this case the 
external connections are in the form of small pins that line 
one edge of the card, and that are plugged into the computer's 
mee rotruccure. 

Packaging constraints are due to physical and economic 


considerations. There is, however, an additional constraint 








that 1s due to performance characteristics. It is desirable 
to reduce the amount of electronic delay through the network. 
Delay is defined in terms of a function proportional to the 
total wire length, or the wire length in the longest closed 
path, or the wire length from an input of a network to an 
output. 

The problem at hand, which is the variant of the partition 
problem that Lawler et al [Ref. 12] considered, is a minimi- 
zation problem. Given a digital circuit specified by the 
logic design, assume that a maximum of M elementary functional 
circuits can be accommodated in an electronic package, and 
that a maximum of P external connections can be accommodated. 
Assume also that in a resultant network of completed electron- 
ic packages, no delay is encountered for interconnections 
within an electronic package, and that a delay of one time 
unit 1s encountered for connections between packages. Find 
an efficient “algorithm that will result in a network such 


that the maximum delay through the network is minimized. "+ 


foe tHe GOALS OF THIS PAPER 
Lawler et al [Ref. 12] discovered efficient, easily 
applied algorithms to produce a solution of the partition 


problem for the case in which the digital circuit is in the 


Mawler, Helin. Levitt, Kee Ne,sdhnad LUuLrner ju .,5 Module 
Clustering to Minimize Delay in Digital Networks," IEEE 
Transactions on Computers, v. C-18, p. 48, January 1969. 





form of a tree, and also for the more general case in which 
the digital circuit 1s in the form of a directed sapeaene iene 
out cycles, 1.e., without feedback. 

The general problem, in which the digital circuit can 
have feedback, is considered in this paper. First a new 
method of representing a digital) circuit in terms sot abr 
partite graph, due to Kodres [Ref. 10], is introduced. Then 
a method of reducing the general partition problem to one 
which can utilize Lawler's results is described. Finally a 
description of the general procedure in terms of the bipartite 
representation is given. 

To this end, Chapter II contains the preliminary concepts 
pertinent to the problem. The formal definition of the pro- 


blem and the solution are given in Sections III, IV, and V. 








II. APPLICATION OF GRAPHY THEORY 


Modern computer circuitry 1s produced using the techno- 
logies of printed circuits and integrated circuits. Both 
these forms of manufacturing restrict the physical intercon- 
nection of elements to a planar surface. This construction 
suggests the use of graph-like structures which are embedded 
in the plane to study the physical properties of the digital 
eamecuits. 

In this section is presented a basic description of graph 
theory relevant to the representation and analysis of these 
physical properties. The terminology and definitions of graph 
theory vary greatly among authors. The terminology and defi- 
nitions in this paper are heavily influenced by Busacker and 


Saaty [Ref. 3] and by Berge [Ref. l]. 


ree SASLC CONCEPTS IN GRAPH THEORY 
1. Definition of Undirected and Directed Graphs 

In order to define an undirected graph, the concept 
of an unordered product of a set with itself is introduced. 
The symbol S & S denotes the unordered product of a set S 
with itself, and is defined to be the set of all unordered 
pairs (s&t), where s e S and t e S. The symbols (s&t) and 
(t&s) denote the same element in S & S. 

An undirected graph G=(V,E,%) is defined to be a 
nonempty set of vertices (or nodes) V, together with a set of 
edges E disjoint from V, and an incidence mapping ¢ of E 


into V & V. Figure 1 depicts an undirected graph. In this 
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Figure 1 


example V= { Vir Va1VarVygr Vor Ve } and E= { C7 1On PO nrO gree ae 
6 (e,) =(v, &v,) and edge ey 1s said to be incident to verticies 


and V.. 


a 2 


Although undirected graphs are used ee analysis of 
digital circuits, the concept of a directed graph is more 
important to the representation of “drequtal-ecrrcults. A 
directed graph is similar to an undirected graph but with the 
added notion of direction assigned to its edges. Formally, 

a directed graph G=(V,A,A) is a nonempty set of verticies 

(ox nodes) V, together with a-set of arcs A, and a directed 
incidence mapping A of A into V x V. Here V x V denotes the 
set of ordered pairs of V using the conventional set-theory 
cross product notation, and if for an arc a in A and a pair 
of verticies Vi and V5 aes A(a)=(vyw) arc a is said to 
join its initial vertex v, to its terminal vertex V>- 

A directed graph is illustrated in Fig. 2. Here an 
arrowhead is appended to an arc to indicate the notion of 
direction. In this example Afaj)=(vyrV5). and the arrow 


points from the initial vertex v, to the terminal vertex V5: 


1! 











at a 


a 





as 


Vi Uy 


Figure 2 


The symbol a,=(Vy«V5) TSerpeads auae ay joins v, to V5." 
Similarly other terms might be used to describe this relation- 
ship such as arc ay "leaves" or "issues from" vertex vy and 
"enters" or "is directed towards" vertex V5. A directed 
graph G=(V,A,A) 1s usually specified by G=(V,A) when the 
incidence mapping is implicit in a description of the arcs. 

There are many definitions and concepts associated 
with directed graphs that are similar to concepts of undi- 
rected graphs. Because the physical structure of a digital 
Circuit can be easily represented in terms of directed graphs, 
the remainder of this section will deal with directed graphs. 
When necesSary in subsequent sections, undirected terminology 
will be clarified. For convenience, directed graphs will be 
referred to in Harary's [Ref. 7] terminology as digraphs. 

2. The Structure of a Digraph 


Certain terminology is useful in describing the struc- 


eure Of a digraph. If a, = (v.w) and a,=(v,w) then arcs a) and 
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a, are Said/to be strictly paral te paeeet a= (w,v) then ay 


and a, are parallel but not strictlysparal tel -eehe ecmteraon 


in Fig. 3 demonstrates these and subsequent concepts which 


eeceintroduced. Ine rigs 3 ance ay and a, are parallel 


Vi 





Figure 3 
If G=(V,A) is a digraph, then a subgraph of Gis a 


graph G,=(V, ,A,) which satisfies the following conditions: 


G 
Lge Vie and A, SA. 


Deel f awe Ay and a~(v,w) then v € Vy and we Vi: 


Connectivity of a graph is defined in the undirected 
sense. If G=(V,E) is an undirected graph then a chain of G 
is a sequence of edges e 


e, in E together with 


1! oor eins k 
their endpoints in V such that each edge Qe. has one vertex 


in common with the succeeding edge e. and the other vertex 


ee 
in common with the preceding edge e,4: G is said to be 
connected if there exists a chain between every pair of dis- 


tinct vertices. A digraph is said to be connected if its 


associated undirected graph is connected. 
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Most of this paper will be concerned with connected 


digraphs. A graph that is not connected can be partitioned 


into maximal connected subgraphs called components. In Fig. 
3 vertices Vir Vor V3 and Va together with arcs Ayr Agr Agr 
ay and a. form one component, and Ve together with aa forms 


another component. Any result that is proved for a connected 
graph can be applied to each component individually. 

If G=(V,A) 1s a digraph, then a path of G is a sequence 
of distinct arcs together with their endpoints such that the 
terminal vertex of each arc coincides with the initial vertex 
of the succeeding arc. A path is said to be Simple if it 
includes no vertex more than once. In Fig. 3, the sequence 
LO. 


dl 5 
A digraph 1S said to be strongly connected if, for 


Vy 1871 Vn 1 gr Vara rVe forms a path from v 


every pair of distinct vertices v and w, there is a path from 
v to wanda path from w tov. Therefore a strongly connected 
graph is necessarily connected, but not the converse. 

A cycle of a digraph is a path in which the initial 
vertex coincides with the terminal vertex. If a cycle C 
traverses vertices VorVyr Var? trVi eV then C is said to be 
simple if NTT for all i,j=0,°**,n. Again referencing Fig. 
3, arcs A, 1azray and ap form a simple cycle. 

An arc a is said to be positively incident with its 
initial vertex v and negatively incident with its terminal 
vertex w. The positive degree of v, S*(v), is the number of 


arcS positively incident with v, and the negative degree, 
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§ (v)., is Similarly defined. The degree of v, 6(v), is the 


sum §* (v) + 6 (v). 


Pe UoE OF BIPARTIIT&E GRAPHS TO REPRESENT DIGITAL Clee. 

The concept of a directed graph can be used to represent 
any circuit that is made up of interconnected functional 
electronic components. The circuit that is to be represented 
here is the digital circuit made up of interconnected funda- 
mental logic set elements that have been previously described. 

A graph with directed edges (arcs) is used to represent 
a circuit because it is often important to distinguish between 
inputs and outputs. An arc directed towards a vertex repre- 
sents an input to the corresponding functional element, and 
Similarly one directed away from a functional element repre- 
sents an output. 

Kodres {Ref. 10] warns, however, that it is inaccurate to 
use a digraph to represent a circuit by associating logic 
elements with vertices and interconnections with arcs. Fol- 
lowing this practice of letting an arc represent the connec- 
tion of distinct Logic elements there is no way to accurately 
represent the branching of one signal to several logic elements. 

Kodres' solution to this problem is to think of both 
logic elements and signals as nodes, and to represent the 
circuit with a bipartite graph which he calls a bi-digraph. 

A digraph is said to be bipartite if its vertices can be 
partitioned into two disjoint sets Vy and V5 in such a way 
that each arc has its initial vertex in one of the sets V, or 


Vor and its terminal vertex in the other set. Since logic 
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elements are connected with interposing signals such a 


partition exists, and a digital circuit can be represented 


as illustrated in Fig. 4. Figure 4a is a simple digital 


circuit, and Fig. 4b is its corresponding bi-digraph 


representation. 


Se 
S30 
@ 
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Figure 4b 
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The vertices of the bi-digraph corresponding to function- 
al logic elements of the digital “elreultvare called@iunc-er onal 
nodes of the bi-digraph. A signal between functional nodes 
is, however, a more complicated notion. One must be careful 
to note the differences between a Signal node of the bi- 
digraph and a signal of the bi-digraph. A signal between two 
functional elements is the combination of a signal node to- 
gether with the arcs incident to the node. In Fig. 4b, 

Signal node Sp together with the output arc from f, to s, and 


e) 8 


the input arc from Sp, to fi. constitutes a Signal from f, to 


5 
‘i 


6° . 
Figure 4b illustrates many features of the bi-digraph 
associated with a digital circuit. First it 1S important to 
note that although the bi-digraph has a bipartite structure, 
it is still a directed graph, and all of the relationships in 
the structure of a general digraph apply equally as well to 
a bi-digraph. But the characteristics of digital circuits 
are such that the bi-digraphs under analysis here are some- 

what simplified. 

It can be seen from Fig. 4b that functional nodes are 
internal in the sense that every functional node has at least 
one input signal and one output signal. This does not imply, 
however, that there are necessarily more signal nodes than 
functional nodes in a bi-digraph. ; 


Since an electronic package interfaces with other elec- 


tronic packages via Signals, the electronic connections to 
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the circuit represented by a bi-digraph are at signal nodes 
called input and output nodes. Input nodes are usually 
placed at the left of a bi-digraph, and output nodes at the 
ment. Ingkeg.,. 4b vertex Sig 1s seen to be both an output 
Signal node and an internal signal node. 

Because of the bipartite nature of a bi-digraph, there 
could never be a loop from and to the same functional element. 
All cycles must be composed of at least two arcs. Similarly 
there could never be two parallel or strictly parallel arcs. 
Parallel arcs represent distinct signals, but in the bi- 
digraph, distinct signals would pass through distinct signal 


nodes. Strictly parallel arcs would imply redundant inter- 


connections, which would be meaningless in a digital circuit. 


C. MATRIX REPRESENTATION OF A DIGRAPH 

There are various ways to use a matrix to represent the 
incidence relations of a graph's vertices and arcs. The two 
representations that are mentioned here are the adjacency 
matrix for representing undirected or directed graphs and 
Saesconnection matrix for representing bi-digraphs. 

Associated with each undirected or directed graph is an 
n xn adjacency matrix A where n is the total number of nodes 
in the graph. Row i of the adjacency matrix corresponds to 
vertex Xs of the graph and column j corresponds to vertex 
50 so that in the undirected case A(i,j) is equal to the 
number of edges incident with both vertex i and vertex Jj. 


For the directed graph the element A(i,j) is the number of 
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arcs directed from vertex 1 to vertex j. The adjacency 
Matrix 1s illustrated for a directed sorapiernie cao 
Associated with a bi-digraph is the n x m connection 
matrix C where n is the number of functional nodes and m is 


the number of signal nodes. The element C(i,j) of the 


Wi 


V3 





Figure 5 


connection matrix is defined by Kodres [Ref. 10] as follows: 
eat 8. 1S an input signal of f. 
C({i,j3) = 0 if 5. does not touch f. 
Sis is mabmomeputms monaliet f. 
Figure 6 shows the connection matrix associated with the bi- 


digraph of Fig. 4b. 
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III. THE PARTITION PROBLEM WITH FEEDBACK CONSTRAINTS 


A cycle of the bi-digraph corresponds to a feedback 
eyele in the digital circuit. Intuitively a feedback cycle 
1S a sequence of signals and functional elements such that an 
electronic pulse applied at one element in the sequence: could, 
after traversing the cycle, be detected at the element at 
which it was applied. 

For purposes of illustration, the bi-digraph in Fig. 7 
and its corresponding matrix in Fig. 6 will be used. The 


fe fe Ss 


1 See A oe! 
f, form a feedback cycle. Cycles in tne bi-digraph will 


henceforth be referred to as feedback cycles. Two feedback 


arcs joining the sequence of nodes f Ss 


Ze 


cycles are said to intersect if they share a common functional 


Or Signal node. 
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The problem of clustering functional elements of a 
digital circuit so as to minimize delay has pecne mace by 
Lawler et al [Ref. 12]. This effort considered the cases 
that the digital circuit was in che form of 1) a rooted tree, 
2) a multi-rooted tree, and 3) a directed graph without 
cycles. For the first two cases computationally efficient 
algorithms were developed, and for the third case an algo- 
rithm was developed which minimizes delay but has certain 
other drawbacks. However the problem of clustering the 
elements of a sickcatl circuit for the case that the circuit 
is in the form of a graph with cycles has not been solved. 

The purpose of this section is to derive a method to re- 
amece a digital circuit that te in the form of a cyclic graph 
to one that is in a form that can be processed by Lawler's 
algorithm. The approach taken here with respect to the de- 


scription of a digital circuit is different from that of 
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Lawler®s in that a digital cireurt any this pace enoee ome 
sented aS a bi-digraph. This will become more evident in 
Section IV in which the results of this section will be 
applied to provide an extension of Lawler's algorithm to 


cyclic networks. 


A. CLUSTERING THE FUNCTIONAL ELEMENTS 

This paper iS primarily interested in determing which 
functional elements of the bi-digraph (those which repre- 
sent elementary logic set elements) should be clustered into 
individual electronic packages. It would be helpful at this 
point to make more precise what is meant by clustering or 
coalescing the functional nodes of the bi-digraph. 

Given the results of an algorithm that indicates which 
functional elements should be coalesced into a package, the 
bi-digraph can then be changed to represent the digital cir- 
cuit in its clustered form. if; t@nsexanple, functional 


elements f £, and £, of Fig. 7 are to be coalesced, the 


Dee es 5 
resulting bi-digraph is shown in Fig. 8. The coalesced nodes 
are represented as being part of a single functional element 
which has been referred to as a "Super-node," and which is 
labeled by convention with the lowest label of the coalesced 
nodes that it represents. 

Since it is desirable to maintain the bi-digraph's bi- 
partite structure, some clarification is needed on how it is 


determined which signals are made to be internal to the super- 


node, and which are made to be external. If a signal node 
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is adjacent only with functional nodes that are to be coa- 
lesced, then the signal node is internal to the circuit 
represented by the resulting super-node. If a signal node 

1s part of a signal which ee between a functional 
element to be coalesced and functional elements which are not 
then a signal node is similarly acne ened with an arc that 
represents an input or output to the supernode in the coalesced 
version of the bi-digraph. This case is illustrated in Fig. 8 
in which S, and S, are input signals and Sp and Sg are output 
Signals to and from the resulting super-node. It is possible, 
however, that a signal node is both input and output to the 
functional nodes which are to be coalesced in the manner 
represented in Fig. 9 by node Si: Suppose functional nodes 

fy and f. are to be coalesced. In this case signal node Sy 

is connected to the resulting supernode by a bi-directional 
arc as illustrated by an edge in Fig. 10. This suffices to 


accurately represent the physical structure, but does cause a 
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Figure 9 


problem in that it introduces a bi-directional arc which must 
be distinguished. This can be easily done by using a special 
symbol in the connection matrix of the bi-digraph, and then 
algorithms, which are implemented on the computer, can be 
adjusted to treat the new arc as if it were pointing in both 


@mrections. 





Figure 10 
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B. FEEDBACK CONSTRAINTS 

The purpose of this section is to derive a method to 
reduce a bi-digraph with feedback cycles to one that is in 
a form that can be processed by Lawler's algorithm. To do 
this it 1S necessary to change the bi-digraph into one with- 
out feedback. This will be done by determining certain sub- 
sets of functional elements which when coalesced leave the 
bi-digraph free of directed cycles. 

156 Cc; and Cy are simple cycles of a digraph then let C 


il 
Ao the intersection of C. and Che Gentarin ENnese vaesm mel 


ke 


Lemma l: Let C. and Cc. be simple cycles of a 


vertices common to both Cc. and C 


Gigraph. If CVC, is nonempty, then 
the vertices of C, and C, are strongly 
connected. 

Proof: Suppose v, is any vertex on Cc; and VE 1s any 


k 


are both in the same simple cycle, from which it follows 


Wemtex on C.. Lf either v. or Vv. 1S in C.f/\C, then yv. and v 
k a 7 k 1 ke 


immediately that Vs and Vv, are strongly connected. 


Lf A and Vy are not in c.ac 


CNC. Since w is on Cir there is a path in each direction 


i then let w be a node in 


between Ve and w. Also since w and v, are bouhmenn Che there 


is a path in both directions between w and Vy Hence there 
is a path in both directions between Ve and ve containing w, 


and Vie and Vv, are strongly connected. Proof completed. 
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Lemma 2: Let Cis os and Cy be simple Veyvcles) Gfea 
digraph. If CNC, is Monon and CLANCS 
is nonempty, then the nodes of C, and C. 
are strongly connected. 
Proof: Suppose that Vv. and ue are nodes on Cc. and oa 
and that Ws is a node in CNC, and w, a node in SS 


k 
to Wa combined with cs from Ws to wa forms a path from v5 


Then that part of Cc. from V5 to Was combined with C, from We 


1s ‘aye A path can be similarly constructed from os to Vi: 
Hence Vv. and Ve are strongly connected, and Since the above 
can be constructed for any nodes V5 and iz in Cc; and Ss the 
lemma follows. Proof completed. 

From the above two lemmas and the fact that strong con- 
nectivity iS an equivalence relation, it is seen that strong 
connectivity is an equivalence relation among the vertices of 
a digraph that lie on at least one cycle. The equivalence 
classes of this equivalence relation are the vertices of the 
digraph that lie in strongly connected subgraphs. A lobe? 
L(v,) 1s a subgraph which contains exactly those vertices 
that are strongly connected to a vertex Vi: 

The major goal of this section was to determine certain 


subsets of functional elements which when coalesced leave the 


bi-digraph free of directed cycles. The above equivalence 


i This definition of a lobe is topologically equivalent 
to the definition of a lobe in Kamae [Ref. 9] but has been 
modified to suit the present discussion. 


relation, when applied to the set of functional elements of 
the bi-digraph which lie on feedback cycles, gives a method 
to determine which functional elements should be coalesced. 
The following lemma formalizes this result. 

Lemma 3: The digraph of lobes is free of cycles. 

Proof: Suppose L(v,) and L(v.) are distinct lobes and 
that there exists a cycle passing Mc Seen L(v,) and L(v.). 
Then the two vertices Vy and Vv, are Sereug aye ONMee rca. 
Therefore L(v,)=L(vo) which contradicts the assumption that 
L(v)) and L(v.) were distinct. Proof completed. 

What follows is the development of a computationally 


efficient method to generate the lobes of the bi-digraph 


represented by the matrix C in Figure 6. 


CeeeeRE DUCTION OF THE PROBLEM 

The first step of the algorithm is to reduce the problem 
of finding the lobes of the bi-digraph to one of finding the 
lobes of a corresponding digraph. From the bi-digraph, 
faerteten in terms of its n x m connection matrix C, 1S pro- 
duced the n x n matrix G. The rows and columns of G corre- 
spond to the functional elements of the bi-digraph (the rows 
of C). Gis produced in such’ a way that the vertices of G 
that are in a common lobe correspond to the functional 
elements of C that are ina common lobe. 

To do this the n x m matrices C4 amd Cy are defined. 


The rows and columns are labeled the same as C and the 


elements are defined as follows: 


Zu 


C_,(4,))=1 if and only 123647) ee 
C,(1,j)=1 1f and only if "eG se 


Then G is formed by the following matrix product: 


es : t 
CoCr 
where * is matrix multiplication, using boolean addition. 
Lemma 4: G(i,j) indicates the existence of a signal 


from f. Ae. * in the bi-digraph. 
. . wae Da. Lee 
Br@or: “Gia, 7) = ) C_, (1_k) C, (k,3) where ) inaglecates 
k=1 


boolean summation. c_, (1,3) = 1 if and only if there exists 


an arc from functional element f. to Signal node s evrGiae 


jet 


C, (kK, 3) = 1 if and only if there exists an are from s, to f 


k a 
Hence G(i,j) = 1 indicates that there exists a signal from f. 
to t. Vila some signal node Sys k = l724*,m. Proetveconolered: 


The matrix G for the graph in Figure 7 is shown in Figure 


aie, 





Figure 11 
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G can be considered as an adjacency matrix of a digraph 
whose vertices are the functional elements of the original 
bi-digraph, as“shown in Fig..12. The vertices ouseey will 


also be referred to as functional elements. 


Figure 12 


By the construction of G there is obviously a close 
relationship between the feedback cycles of C and the cycles 
eee. Lt will be shown that Chis relationship is such that 
the functional elements that lie in a lobe of C are also ina 
lobe of G. 

There are five representative ways in which two simul 
feedback cycles C. and os might intersect in C, as illus- 
iemaeed In Fig, 13, If, as shown in Fig. 13(a), Cc; BNE! KG 


J 
Share a common signal, then the corresponding cycles G; and G. 
of G will share a common arc. This is by the fact that there 
exists an arc from f. to fe if any only if there exists a 


Signal from f. to ee Lp ue Be 
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(2) 





(b) 








Figure 13 
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bi Cy and C. share a common input arc Or OUtpuUbeaveurn 
C, but do not share the entire signal, then G WS constructed 
as in Fig. l3(b) or 13(e)Q Since the dermis ten o.oo 
nal includes a signal node in combination with an input arc 
and an output arc, the signal between f. and f. 1s distinct 
from the signal between f, and f,. Hence the resulting cycles 
in G share a common vertex. 

it as ine Pigs. 3 (qd) Cc. and Cc, traverse a common functional 
node but do not otherwise intersect, then the structure of the 
corresponding cycles in Gis similar. 

Bastly, if Cc, and Cs Share a common signal node in C, 
then a number of intersecting simple cycles occur in G. This 
is illustrated in Fig. 13(e). 

The above five cases exhaust the possible ways in which 
cycles might intersect or traverse common nodes in C in the 
sense that any more complicated intersection of cycles could 
be broken down to multiple occurrences of these. By the 
resulting structure of G in each case, if two functional 
elements f. and S. are strongly connected in C then they are 
strongly connected in G. The reverse is also true. Namely, 
1f two functional elements f. and t. are strongly connected 
in G, then there exists paths from f. to fe and from f. ee 
f. in G, which by the construction of G means there exists 
Signals in both directions connecting f. and fy in C. Hence 


this theorem follows. 
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Theorem 1. For any two functional elements f, and fo, 


the conditaon Chait f, is in L(f,) holds 
in C if and only if it holds ine. 
Therefore by finding the lobes of G one determines which 


functional elements of the bi-digraph represented by C should 


be coalesced so as to leave C free of feedback cycles. 


D. GENERATION OF THE LOBES 

An easy way of generating the lobes of Gis by first 
generating the simple cycles of G. This can be done rather 
quickly due to an algorithm developed by Cochrane [Ref. 5]. 

Let P = { CyiCor tt tC, } be a maximal set of simple cycles 
of G that is formed in the following way. First let P contain 
any simple cycle of G. Then successively place into P all 
Simple cycles which intersect with any cycle previously placed 
into P. P is maximal in the sense that P contains all simple 
cycles that intersect with any cycle in P. 

ea CiUC, denote the strongly connected subgraph that 


is formed by the arcs and vertices of C, and C.,. Then by 


1 Z 
Lemma 1 and Lemma 2, UP = CJjUCLUC3*** UG, is a lobe of G. 
All of the lobes of G can be determined by finding UP, for 
each such set of simple cycles P. Ln “Ge 

The algorithm due to Cochrane to determine the simple 
cycles of a graph G is based on the concept of an attainability 
tree. The attainability tree T(v,) of a vertex Ve in Gis 


a graph that is produced in the following way. First place 


Vv; in T(v.). Then add to T(v;) all those nodes We of G that 


a\/2 








can be reached from we by a path of length 1, and connect 
each Mo ie V5 by an arc directed toward Vv; img T(v;). Then 
again determine in G which nodes are reachable from Vv; by 
directed paths, this time of length 2. These vertices are 
the same as those reachable from each Ws by paths of length 
l. Place these vertices in T(v;) with arcs directed toward 
the corresponding Bas Continuing in this manner for succes- 
Sively longer paths, the attainability tree T(v;) iS pro- 
duced with Me) at the root, and the branches directed toward 
v.. By traveling from a node of a branch of T(v,) to the 


1 
root, one traverses in reverse the nodes of a path from Nee 
in G. A branch of the attainability tree is terminated after 
the first occurrence of a node being repeated in that branch, 
which indicates that a simple cycle has been found. 

Using Fig. 12 as an example, the attainability tree 
T(f£ 5) of G, after all paths of length 4 from f. have been 


found, is illustrated in Fig. 14. On the next iteration, 





Figure 14 
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when considering paths of length 5 in G from f., the vertex 
f. is repeated in the right branch. At this point the 


presence of the simple cycle that traverses vertices f f 


72 
and fe 1s discovered, and that branch can be terminated in 


4 a 


the attainability tree. Hence, after this iteration the 
mietainablilitty tree 1S as in Fig? 15. 

By continuing to build the attainability tree T(f.), all 
Simple cycles of G will be discovered that are reachable from 
f.. The cycles are simple because they are discovered at 
the first occurrence of a repeated vertex. 

As can be seen in Fig. 15, the left branch of T(f.) will 


repeat the simple cycle fo, fy, fo on the next iteration; 





Figure 15 


hence the attainability tree as constructed can repeat 


Simple cycles. However, by first successively removing all 


source and sink nodes from G, the multiple occurrence of 
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Simple cycles in an attainability tree is greatly reduced. 

In order to produce the simple cycles of the entire graph, 

it is only necessary to produce attainability trees for 
vertices that have not appeared in previously produced attain- 
ability trees. 

Summarizing what has been said, the following is a de- 
Scription of the algorithm to find the simple cycles of G: 
Step 1. Successively remove all source and sink nodes of G. 
Piece step 1. 

Step 2. Produce the attainability tree T(f;) for some node 
f. that has not previously appeared in an Ret aia nee tree. 
Tiomeerme 4 vertex 1s added to the @eree, heck @the vertices 
of its branch for the duplication of a vertex, which would 
indicate a simple cycle. Terminate a branch of the tree as 
soon aS a Simple cycle is indicated in the branch. End Step 
2% 

Step 3. Record the simple cycles. If there are any vertices 
of G that have not yet appeared in an attainability EEee, 
teen return to Step 2. End Step 3. 

The goal of this section was to produce an algorithm to 
find the lobes of G, and thereby indicate which functional 
nodes of the bi-digraph should be coalesced. By adding the 
following two steps to the above algorithm, this has been 
accomplished. 

Step 0. From the connection matrix C of the bi-digraph, 
ie 


compute G = C_y ° Cy . Proceed to Step l. End Step Q. 
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Step 4. From the list of simple cycles from Step 3, deter- 
mine a maximal set of intersecting simple cycles 

1s { Cee, }. Find) aslob-wo saGcep: computingUP,. 
Repeat this step until all such sequences of cycles have been 
processed. 

E. COMMENTS ON THE IMPLEMENTATION OF THE ALGORITHM ON THE 

COMPUTER 

eerie ope ene reeneeeatien of storage, it is not 
efficient to store a graph in the computer in matrix form. 

In particular, the connection matrix of the bi-digraph and 
the adjacency matrix of the digraph that are used in the 
present algorithm are sparse matrices. Although there are 
several methods of storing a sparse matrix compactly, some 
inhibit the quick retrieval of information by requiring a 
program to perform several computations in order to determine 
the value of a particular entry. 

In the computer program which illustrates the algorithm 
developed here (page 68), the connection matrix of the bi- 
digraph has been somewhat modified. Using the connection 
matrix in Fig. 6, the functional nodes are. numbered J Enough 
7, and the signal nodes are numbered 8 through 18. Then the 
graph is stored in two lists. This is illustrated in lap 5 dL - 

The index of list N corresponds to a node of the graph, 
and the eee N(i) indicates the corresponding starting 
imidex Of list M. Starting at N(i), the entries of M indicate 


which nodes of the graph are connected from node 1 by a 
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N Il 2.3 4.5 6 7 8 9 10017 Wye See ole ae 
T2423 4 5 7.8 9 10 11°12 12S ai cme 20 


M Lo2)3 4-5 6 7 8 9 10 Ll 12 13 914 sioeiG 7 ese oe) 
iets 13 14 loko 1/7 Cee 4. 2 8 > 6 6317 sae 4 


Figure 16 


directed arc. The complete list of such nodes occupy M(N(i)), 
M(N(i)+1),+°+,M(N(it+t1)-1) in M. For example N(5) = 5 and 

N(6) = 7, which means that an arc issues from functional node 
f, to the nodes listed in M(N(5)} and M(N(5)+1) = M(N(6)), 
i1.e., to signal nodes 15 ARG how On allGmeo Lili eE Go mo). 

The above construction is applicable to both digraphs and 
bi-digraphs. In the case of bi-digraphs, as was the case in 
meg, 16 for the n xX m connection matrix of Fig. 6, the first 
n indices of list N correspond to the n functional nodes, and 
the last m indices of N correspond to the m signal nodes. 

The resulting saving in space is obvious, for to store a graph 
in the computer in this way, one needs only as many memory 
locations as there are arcs and nodes in the graph. 

It is possible to store the bi-digraph even more compactly 
by making further use of the bi-partite structure of the graph, 
and indicating the direction of arcs with a + or - sign as is 
done in the connection matrix. To do this the first list 
contains indices for only the functional nodes, and the second 
ies contains signal ape ecm ant the sign of the signal node 
indicating the direction of the arc (towards or away from the 


Signal node). This method of storing the graph, however, 
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makes it difficult to determine which functional nodes are 
connected to a given signal node. To obtain this information, 
a search of the entire length of the second list is required 
in order to determine each occurrence of the given signal 
node, and therefrom to determine an immediately connected 
functional node. 

The algorithm developed in this section is illustrated 
by the computer program on page 68. The program is divided 
into several logically independent sections. The program 
starts with the bi-digraph C stored in compact form. Then 
two list structures are built which reneatent the directed 
graph of functional nodes G, and which enable the quick 
removal of successive sink and source nodes. From the re- 
sulting structure of the graph the attainability trees are 
produced using a list Structure, and the simple cycles of 
the graph are discovered. Finally the simple cycles are 
combined as previously described in order to produce the 
lobes of the graph. These logical divisions in the program 
allow considerable overlaying of data structures from one 


part of the program to the next to conserve storage. 
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TV. EXTENSION OF THE CLUSTERING ALGORITHM TO CYCEteee i 


The partition problem is far from being solved in the 
general case. As mentioned previously, Lawler et al [Ref. 
12) have provided solutions to the partition problem for the 
ease an which the digital circulG 1s in the formcieaeneoued 
tree, or in the form of a multi-rooted tree. By applying the 
results of Section III, the functional elements which 1lie on 
feedback cycles can be coalesced to produce a graph without 
feedback. If this resulting graph has the form of one of the 
above two types of trees, then Lawler's algorithm can be 
applied to complete the solution to the problem. This assumes, 
of course, that each lobe of the original graph satisfies the 
constraints on the maximum number of logic elements and ex- 
ternal connections that can be associated with a single 
eluster® 

If the graph which results from coalescing the lobes 1s 
not in the form of a tree, but rather is in the form of a 
general acyclic graph, then Lawler's extended algorithm can 
be used, but only by allowing undesirable node replication. 
This node replication renders the algorithm unsatisfactory 
from a practical point of view. It is included primarily for 
theoretical reasons. 

Sections IV A, IV B and IV C briefly describe Lawler's 
results as applied to rooted trees, multi-rooted trees, and 
acyclic graphs. This presentation differs from Lawler's in 


that the application here is to the digital circuit expressed 
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as a bi-digraph, rather than a digraph, and as such several 
modifications have been made. Section V describes the 
problem of coalescing a subset of the functional elements of 
a tebe in the case that they donot fit withinea single 


cluster. 


moe CLUSTERING. ALGORITHM APPLIED TO RESULTANT ROOTED TREES 

Given a digital circuit expressed in the form of a bi- 
digraph, it is assumed that the algorithm developed in Section 
III has been applied and that the lobes of the bi-digraph, if 
there were any, have been coalesced into super-nodes. It is 
further assumed in this and in the following two sections that 
the lobes were within the space and external connection con- 
Straints for a single cluster. 

This section considers the case that the bi-digraph of 
Signal nodes, functional nodes, and super-nodes is in the 
form of a rooted tree, a admaceed graph with only a single 
source node (in-degree zero) or sink node (out-degree zero), 
and whose corresponding undirected graph 1S connected and 
free of undirected circuits. Functional nodes f. and f. rep- 
resent super-nodes formed by coalescing lobes of functional 
nodes in the original bi-digraph. The graph in Fig. 17 is a 
sink tree (all the arcs are directed toward the root). The 
procedure to be described works equally as well for source 
merees, with certain obvious modifications. 

The problem is to determine which functional elements of 


the bi-digraph should be coalesced so as to minimize delay 
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through the network. An external arc causes one unit of 


Gelay. Each coalesced group of functional elements must 
satisfy two further constraints: 1) the number of function- 
al elements must not exceed the bound M, and 2) the number of 
external connections must nee exceed P. This probiem can be 
formulated as that of labeling the nodes of the graph with 
integers. The functional nodes of the graph to be coalesced 
will be those functional nodes that are connected and that 
are assigned the same label. The label of the functional 
node will indicate the maximum delay through the circuit to 
that point, following the convention that external signals 
have a delay of one time unit and internal signals have neg- 
ligible delay. 

The labeling process can be described as follows. Asso- 


Cliate with each node of the graph a non-negative weight Ws 


4l 


which indicates the number of functional elements the node 
represents. Hence signal nodes have weight zero, functional 
nodes of the original bi-digraph have weight one, and super- 
nodes have weight equal to the number of functional nodes 
they represent. Then label the nodes of the sink tree 
eecording to the following rules: 


Rule 1. Nodes with in-degree zero (input Signal nodes) will 


be given the label 0. 


Rule 2. A node i is to be given a label at least as large 


as the largest label given to any of its predecessors. 


Rule 3. The sum of the weights of any given node, and all of 


its predecessors which have the same label, is not to 


exceed M, 


Rule 4. The total number of pin connections of a given node 
and all of its predecessors which have the same 


label is not to exceed P. 

Rule 4 needs some clarification. A cluster will be com- 
posed of functional elements with the same label. Sucha 
cluster is determined by finding a functional element no 
Baeeoassor Of which has the same labell. Then, forma cluster 
with that element and all of its predecessors that have the 
same label. 

An external connection is a Signal that communicates 
between nodes of a cluster and nodes not ina cluster. As 
Successors of a node are labeled, it must be determined that 
the total number of arcs that lie on external signals to the 
indicated cluster, is within the bound P. The indicated 


cluster is the cluster that will be formed by this group of 
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nodes with the same label. If P is exceeded, then the label 
assigned to any successors of the last node labeled must be 
labeled one greater. 

It is desired to produce an algorithm that labels the 
nodes of the graph according to the above rules, and such 
that the largest label given any node is as small as possible. 
The following notation from Lawler will help to describe the 
algorithm which accomplishes the desired labeling. 

For any subset of nodes S (both functional and signal) 
let p(S) equal the number of arcs with exactly one end point 
in S, and which lies on a Signal that communicates between a 
node of S and a node not in S. Hence p(S) indicates the 
number of pin connections due to external signals to and from 
S. Let R(i,k) denote the set of k-predecessors of node i, in 
other words, the set of nodes which are predecessors of node 
1, and which have the label k. Let w, (k) be the total weight 
of all of the k-predecessors of node il. 

Then the labeling algorithm developed by Lawler and 
applied to bi-digraphs in the form of a rooted tree ee 
LOlLlows: 

Step 0. Label all input nodes 0. 

Step 1. Find any unlabeled node i, all of whose predecessors 
have been labeled. Let k be the largest label applied to any 
of these predecessors. If DR Ge) e/a) ee ands ie we” + 
w, (k) < M, then assign label k to node i, else assign label 


k+l to node 1. 


Repeat Step 1 until all nodes have been labeled. 
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The result of this algorithm for M = 3 and P = 5 
applied to the bi-digraph of Fig. 17 is illustrated in Fig. 
18. The label applied to a node is enclosed in parentheses. 


The resulting clusters are indicated by broken lines. 
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According to the procedure that functional elements are 
to be placed in the same cluster if they are connected by a 
path and have the same label, nodes f, and f. each forma 


separate cluster. It should be noted that by pile aleve, ch, Systels! 


1 
f., in the same cluster, the constraints are not exceeded, and 
the delay through the circuit is not increased. Hence they 
are shown in Fig. 18 as being in the same cluster. The above 
algorithm works equally well for source trees as for sink 


trees by simply replacing each occurrence of the word 


"predecessor" with "successor," and each occurrence of the 
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word "sink" with "source," in the above discussion and algo- 
rithm statement. 

in Lawler's statement of this algorithm nodes were each 
given a weight of one. The concept of weighted nodes was 
defined to facilitate the extension of this algortenm te 
multi-rooted trees. This will be discussed in Section IV B. 
However, uSing the weight of a node to represent the number 
of functional elements that a node indicates, facilitates 
the application of the algorithm to the bi-digraph. This 
makes it easy to distinguish between Signal nodes, functional 
nodes, and Super-nodes which result from previous clustering. 
By including signal nodes in the labeling algorithm, such 
definitions as the set of k-predecessors of a node can be 
stated without the complication of having to distinguish be- 
tween different types of nodes. Since a Signal node is as- 
Signed weight zero, it is not necessary to treat Signal nodes 
separately when determining the total number of functional 
elements that are in a cluster. The weight of a Signal node 
is simply added to the previous total. Similarly, since a 
Super-node represents more than one functional element of the 
Original bi-digraph, the total number of functional nodes 
that it represents is easily obtained so that constraint (1) 


may be checked simply by addition of weight. 


B. EXTENSION OF THE ALGORITHM TO RESULTANT MULTI-ROOTED TREES 
If the bi-digraph which results from the coalescing of 


lobes 1S in the form of a multi-rooted tree (a tree with more 
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than one source or sink node), then the multi-rooted tree can 
be separated into a set of rooted trees to which the algo- 
rithm of Section IV A can be applied. The bi-digraph of 

Fig. 19 will be used to illustrate the procedure of this 
section. The weights of each functional node again indicate 
the number of functional elements of the original bi-digraph 
that are represented, and the weight of each signal node is 


Zero. 





Figure 19 


The first phase of the following algorithm separates the 
multi-rooted tree into a set of rooted trees that can be 
processed individually using the algorithm of Section IV A. 
This 1s done by temporarily removing certain arcs of the 


graph. The arcs removed are placed ina set A. The removal 
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of the arcs is done in such a way that each source-to-sink 
path in the multi-rooted tree passes through at most one of 
these arcs. 

The tree separation procedure is expressed as Steps 0 
and 1 of the clustering algorithm for multi~-rooted trees. 
Step 0. Choose any input signal node and all of its succes- 
sors as an initial source tree. Check each of the nodes of 
this tree. 

Step 1. Find an arc, exactly one end of which is checked, 
and place this arc in A. If the terminal (initial) vertex of 
this arc is the one checked, then the initial (terminal) 
vertex and all of its predecessors (successors) is choSen as 
a sink (source) tree. Check all of the nodes of this tree 
and repeat Step 1 as often as necessary. 

The result of the above two steps is shown in Fig. 20. 
Signal node s is chosen arbitrarily as the initial source 
node, and the arcs that are put in A are indicated by shading. 
It can be seen in Fig. 20 that any source-to-sink path passes 
through at most one arc in set A. The bi-digraph has been 
separated into three rooted trees, the source tree with root 
S, and two sink trees. 

The next step in the clustering algorithm is as follows: 
Step 2. Apply to each rooted tree the labeling procedure of 
Section IV A. For source trees the labeling procedure goes 
from sinks to source, and for sink trees the procedure goes 


hrEom Sources to Sink. 
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as Ina lev (jk), denote the signal from z to fy which 


includes arc ass where ae and fy are in F. Then the maximum 
delay along a path of the type completely contained ina 


Single cluster is D where 
D = max { kK. } 
\ f; eF 
where kK. is the label applied to functional node f.. The max- 
imum delay encountered along a path of the-type that includes 
an arc of A is D' where 
D' = max { Ks + Ky + 1 } 
all(j,2%). 
where the maximum is taken over all signals (Jd) associated 
with all arcs a; eee, wee Ay and k, are the labels assigned 


x 


to functional nodes = and f) respectively. Care in defining 
D' in terms of signals instead of arcs is required by the fact 
that the label, assigned to a Signal node, which might be one 
endpoint of an arc in A, might be one greater than Pei 
asSigned to the functional nodes which immediately precede it 
in the case of a sink tree, or which immediately succeed it 
in the case of a source tree. The delay which is associated 
with such a path is the same as the largest label assigned 
to a functional node of the path (not the signal node, which 
could have a label one greater). 

if DD" < D, Ehen™ ene clustering is optimal, since the max- 


imum delay of the circuit is along a path that is completely 


contained in a single rooted tree. If on the other hand 
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D' > D, it might be possible to decrease the maximum delay 
by one unit. 

Consider the arc a, e€ A, which is traversed by the path 
P of greatest delay. Since a, € A, arc a. is an external arc 
in the clustering indicated. Arc a. might be critical in 
the sense that if a; could be made internal to a cluster 
along path P, the maximum delay of the circuit might be re- 
@uced by one unit. It is possible, however, that if A is 
changed by removing as and by including some other arc, that 
another arc might become critical. The possibilities have 
been enumerated and analyzed by Lawler and the results are 
expressed in the following steps which complete the algorithm. 
The steps define a relabeling of the nodes of the graph to 
indicate clustering with minimal delay. The steps are sub- 
stantially the same as were expressed by Lawler et al [Ref. 
12}, except that certain definitions have been modified to 
account for the bi-digraph representation. Also the algo- 
rithm as stated here takes into account both the maximal 
weight constraint M and the external connection constraint P. 
meepr 3. Compute 

D = max { kK. } 
f; eF 


and compute 


+1 } 


| pa 
a nao katKy 


If D‘ < D, proceed to Step 6, else proceed to the re- 


labeling procedure, Step 4. 
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Step 4. Repeat this step for each rooted tree in turn in 
opposite order from that in which they were formed in Steps 
0 and 1. Assume the tree in question is a sink tree (modifi- 
cation for a source tree is straightforward). 

4a) Label all source nodes 0. 

4b) Find any unlabeled node i, all of whose predecessors 
have been relabeled. If this node is the sink node, pass to 
substep 4c below. lLet k be the largest label of any prede- 
cessor functional node, and k' be the largest label of any 
successor functional node not contained in the sink tree in 
question. Assign a new label to node 1 according to Table l. 
Repeat this. step as often as necessary. 

4c) In the Cues of a root node, let k be the largest 
label of any predecessor functional node. If Ww, tw, (k)< M, 
assign the label k to the node, otherwise assign the label 
k+l. If the value of this label is greater than D'=-1l, go to 
Step 6. | 
Step 5. Replace each label ks Obedimed) Her ede OUReCm tke man 
Step 4 by (D'-1)-k. The resulting labeling is optimal. End 
of procedure. | 
Step 6. Replace each label Kk obtained for a source tree in 
seep 2 by D-k,. This labeling is optimal. End of procedure. 

The results of applying this algorithm to the bi-digraph 
of Fig. 19 and Fig. 20 will be illustrated by Fig. 21 through 
23. Figure 21 shows the results of Step 2, the labeling 


procedure of Section IV A for M = 3 and P = 5. The clusters 
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Figure 21 


Peer idlecated In Pigg 2ieby sets of encsrcled nodes, —“Con- 


puting the value required in Step 3, it is found that 


D = max { Kk.) = 2 


fs eF 


and 


D' = max { Hs +1} = 3. 


g 
all(j,&); 


Since D' > D the relabeling procedure of Step 4 using the 
instructions of Table 1 is required to produce an optimal 
clustering. Following these instructions, the resulting 
labeling is illustrated in Fig. 22. Note that the arc which 


has weight 3 iS a super-node and has this weight due to the 
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clustering algorithm of Section III. The functional elements 
labeled f. and fi Which now each have weight 2 are not Super-_ 
nodes. Their weights were originally 1, and were increased 
by the instruction add Ww; '(k") to Ws in Table 1. This is 
required as a result of the relabeling scheme, since node = 


will now be included in the cluster with fir and £, with f,. 


yt k 
{In order to complete the relabeling algorithms for the third 
Sub-tree, it is necessary that the weights of these additional 


functional elements be used in determining the total weight 


of the cluster, 
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Step 4 1s now completed, and Step 5 is executed. This 
completes the relabeling process, and as can be seen in Fig. 


23, the maximum delay has been reduced in two. 
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C. COMMENTS ON A PROCEDURE FOR RESULTANT ACYCLIC GRAPHS 
Lawler's algorithm can be applied to a resultant acyclic 

graph that is not in the form of a tree, but only with limited 

success. The procedure that was applied to rooted trees can 

be used to produce a minimal cluster of an acyclic graph, 

but only with the added provision that a certain amount of 

node replication be allowed. Hence, if after the application 


of the algorithm of Section III the resultant acyclic graph 
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is not in the form of a tree (either rooted or multi-rooted), 
a satisfactory solution does not exist to complete the 
clustering. Although the procedure of Section III makes the 
algorithm applicable to a larger class of graphs, namely 
those graphs containing directed bycles, the clustering algo- 
rithm for resultant general acyclic graphs is still incom- 
plete. This section will first show how the algorithm for 
rooted trees has been applied to acyclic graphs, and will 
then give an example which illustrates the reason this appli- 
cation is unsatisfactory. 
The algorithm of Section IV is applied to acyclic graphs 

in the following way: 
Steps 0 and 1. Label the nodes as in the procedure for rooted 
Sink trees. 
Step 2. Locate each node of the graph that has the property 
that no successor of the node has the same label as the node 
Meselt. This node and all of its k predecessors form a single 
cluster. 
Step 3. Construct a new graph as follows: 

a) For each node of the original weary. create a 
node Jaq for each cluster gq in which node j is contained. 

b) For each arc ass from nodes i to j in the 
Original graph (one of these is a signal node and the other 
a functional node due to the bipartite structure), create an 


arc from node _ to ae where 
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1) p=qif i and j have the same label 
11) pis chosen arbitrarily, otherwise. 
The application of the above steps is illustrated in 


Fig. 24. In this example the result of Step 2 produces 


immediately a feasible clustering for M = 3, and in such a 


way that Step 3 produces an optimal clustering as is shown 


in Fig. 24a. Node replication is illustrated by signal node 


Sy in Fig. 24b. In this case, following Step 3 precisely 
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as stated, signal node Si LS Yeplicateadsanamnade Janpart of 
each cluster indicated tiisssrcceducom i « interpreted to 
mean that the same signal is applied at both occurrences 

of node Si: In the above case this poses no problem, since 
by the method of clustering nodes described in Section III AG 
Signal node Si would be external to both clusters, and would 
merely be common to the input signals to each cluster. 

But in more complicated structures node replication be= 
comes a major problem. For the graph in Fig. 25a, and the 
resulting structuring in Fig. 25b, excessive node replication 
renders the procedure aoe sacle, Note that not only are 


Signal nodes replicated, but so also are functional nodes. 


This usually requires extensive logic redesign. 
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V. COALESCING THE FUNCTIONAL ECEMEMIS Vile LOR: 


A. NEW CRITERIA OF CLUSTERING 

In the case o£ a digital circuit toy enouteteedbackt, FE eiss 
natural to define optimization criteria in terms of delay. 

The primary objective that seems most natural is EWerone 
stated in Sections I and III, namely, minimize the maximum 
delay caused by external arcs over the longer paths of the 
erECcu1t. 

This 1S not, in general, the case for the lobes of a 
circuit with feedback. Maximal paths are difficult to define, 
Since if a cycle is included among the arcs from one vertex 
to another, an electronic pulse traversing this sequence of 
arcs might traverse the cycle any number of times. It is 
certainly true, however, that in the final solution of the 
partition problem applied to the functional elements of a 
lobe, some definition of maximal paths will have to be made, 
and delay across them be considered. | 

The criteria by which the functional elements of a lobe 
feeioe COdlesced im this Section 1S Stated in terms Of a 
distinguished node of the lobe and the tracks of a lobe. 
Following Berge {Ref. 1], if u and v are nodes, u # v, a track 
from u and v is defined to be a path of minimum length going 
from u to v. Since a lobe 1s strongly connected, there always 
exists a track between any two vertices of a lobe. 

The distinguished node of the lobe 1s, intuitively, that 


node which represents an output from the lobe, and at which 


3) 





1£ is important, according to tie wileogsve Ot seiewerecul ime 
have a usable electronic pulse with minimal delay. It may 

be unrealistic to expect the logic designer to distinguish 

Just one node of a lobe as being the most important if the 

lobe is very large. Assume, however, that such a distinguished 
node can be identified. 

Considering a lobe in relation to the entire circuit, 
there might be several arcs which carry signals from function- 
al elements outside the lobe, to functional elements inside 
the lobe. The criteria of minimizing the delay through the 
entire circuit is still of prime importance here. In the 
case when a lobe satisfies the constraints on clustering, and 
the lobe is contained within a Single cluster, then the delay 
through the entire circuit 1s not increased by external con- 
nections within the lobe. But when the functional elements 
of the lobe must be placed:in more than one cluster, it is 
of interest to do this in such a way as to minimize the 
effect of inereased delay over the entire circuit. Therefore 
the distinguished node of the lobe would be that node which 
lies on an output signal of the lobe, and to which it is most 
important to minimize delay along a path from any node of 
the lobe to the distinguished node. 

Hence, the criteria by which the functional nodes of a 
lobe will be clustered is to minimize, for every node in the 
lobe, the delay between each node and the distinguished node. 


The algorithm to be produced, which will find a clustering 
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that satisfies this criteria, does not in any way depend on 
the physical characteristics of the distinguished node. 
Hence any node of the lobe could be chosen for this purpose. 
it would be~up tothe logqve designer ser ele computccr, = to 
determine the identity of the distinguished node based on 
the logic of the circuit or some graph theoretic property 


Cumeche Circuit. 


B. STATEMENT OF THE ALGORITHM 
For the purpose of describing the algorithm, the lobe in 


Fig. 26 will be used. Suppose that signal node s, is the 
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distinguished node. Then it is the goal here to indicate 
which functional elements are to be coalesced so that the 
delay along a track (path of minimal length) from any 


functional node to Sy Le Minimized. FOr MilusStration 1t wiles 
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be assumed that no more than three functional elements can be 
within a single cluster. 

To determine the cluster, a reverse attainability tree 
TCs 4) 1S constructed, with the distinguished node S, as the 
root. A reverse attainability tree is similar to an attain- 
ability tree, except that instead to placing on the branches 
those nodes which are attainable from the root, one places 
on the branches those nodes from which the root is attainable. 
The reverse attainability tree is built in exactly the same 
way aS was the attainability tree, except that instead of 
stepping along the paths of the graph from the distinguished 
node, determining the branch nodes which are reachable by 
paths of increasing length, one steps along the paths in the 
opposite direction than indicated by the arcs of the graph. 
The reverse attainability tree T (sy) for the lobe in Fig. 


foes shown in Fig. 27. 


As the reverse aeeamability tree iS) beimrembul leer & 
record is kept of the level at which each node first appears 
in the tree, the reverse attainability tree can be reduced 
to the tree in Fig. 28. This tree contains each node exactly 
once, and in each case the appearance of the node is on that 
branch which represents the shortest path from any node to Sy. 
If a node appears on different branches at the same level of 
the reverse attainability tree, then either occurrence can be 


used in producing the reduced tree. Hence by comparing the 


tree in Fig. 28 with the graph in Fig. 26, and by following 
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the corresponding paths from any node to Sar SINcesthis park 
is the path of minimal length from the node to Sar Hite | 2 Soa 
track in the original graph. Hence the tree in Fig. 28 
represents the tracks of Fig. 26. Designate this tree by 
Ti(s,), the tree of tracks from each node of the original 
lobe to the distinguished node Sy: 

Once TCs) is produced, the algorithm is completed by 
applying the labeling procedure of Section IV A for rooted 
Sink trees. By treating T, (sq) aS a bi-digraph, the function- 
al nodes are clustered in a way which minimizes the maximum 
delay through the tree. This has the effect of minimizing 
the maximum delay along any track in the original bi-digraph. 
Hence by clustering the nodes in this way the eee 1wO)1G 
clustering the functional nodes of the lobes of a bi-digraph 
has been met. 

The result of the labeling procedure is shown by the 


numbers in parentheses in Fig. 28, and the original bi-digraph 


is shown in Fig. 29 in its clustered form. In this example 


| nie bb aS) 








the weights of the functional nese were assumed tO bel. 

The maximum number of functional nodes per cluster is three, 
so M = 3, and any limitations on the number of external 
connections Sah be satisfied by simply checking the number 


of connections to a cluster in the original bi-digraph. 
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VI. CONCLUDING REMARKS 


The solution of the partition problem is far from being 
complete. In fact, at this point in time, even the statement 
of the problem is tenuous. Solutions are offered by investi- 
gators not yet with the intent of providing the complete 
answer, but only with the hope of Bereeing a new approach or 
a new insight, both to refine the definition of the problem, 
and to advance the solution to the problem as it is now stated. 

It is clear that the digital circuits represented in terms 
of graph theory are the digital circuits of computers yet to 
be designed, and that the algorithms that are to provide work- 
able solutions to the problems of design must be written with 
the intent that they will be programmed on the computer. The 
algorithms that have been presented io this thesis are such 
that they could be programmed. But much work has yet to be 
done, and much work is being done, on determining the best 
methods to represent a graph in the computer, and the best 
ways to implement algorithms which seem straightforward to 
humans, but not so to computers. 

The computer program which follows is meant to illustrate 
the results of Section III, the extraction of a lobe from the 
bi-digraph. It has been included not because it 1s expected 
that the data structures which are used would ever be used in 
aeelrcuit designing system. Rather it has been included be- 
cause the finding of maximal strongly-connected subgraphs is 
a problem in itself, which has many other applications 


besides its part in the partition problem. 
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Some other topics which are being investigated which 
closely affect the partition problem include: 
i} Finding an adequate solution for general acyclic 
networks. 
Li)” Finding alternates@prrmmizationcriterrasor the 
elustering algorithm. 
111) Determining the best ways to represent a graph in 


the computer, with emphasis on efficient graph manipulation. 
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